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SEQUENCE LISTING 

<110> institut National de la Sante et de la Recherche M 

<120> A polypeptide derived from gp41, a vaccine composition 
comprising said polypeptide, and uses for treating an infection 
by an HIV virus in an individual 

<130> P852-PCT2-INSERM 

<140> 
<141> 

<160> 4 

<170> Patentin Ver. 2.1 

<210> 1 

<211> 1168 

<212> PRT 

<213> Homo sapiens 

<400> 1 

Met Ser lie val lie Pro Leu Gly val Asp Thr Ala Glu Thr Ser Tyr 
15 10 15 

Leu Glu Met Ala Ala Gly ser Glu Pro Glu ser val Glu Ala ser Pro 
20 25 30 

val val Val Glu Lys Ser Asn Ser Tyr Pro His Gin Leu Tyr Thr ser 
35 40 45 . 

Ser Ser His His Ser His ser Tyr lie Gly Leu Pro Tyr Ala Asp His 
50 55 60 

Asn Tyr Gly Ala Arg Pro Pro Pro Thr Pro Pro Ala ser Pro Pro Pro 
65 70 75 80 

Ser val Leu lie Ser Lys Asn Glu Val Gly lie Phe Thr Thr Pro Asn 
85 90 95 

Phe Asp Glu Thr Ser ser Ala Thr Thr lie Ser Thr Ser Glu Asp Gly 
100 105 110 

Ser Tyr Gly Thr Asp val Thr Arg cys He Cys Gly Phe Thr His Asp 

.115 120 125 

Asp Gly Tyr Met lie cys cys Asp Lys cys Ser val Trp Gin His lie 
130 135 140 

Asp Cys Met Gly lie Asp Arg Gin His lie Pro Asp Thr Tyr Leu cys 

145 150 155 160 

Glu Arg Cys Gin Pro Arg Asn Leu Asp Lys Glu Arg Ala val Leu Leu 
165 170 175 

Gin Arg Arg Lys Arg Glu Asn Met Ser Asp Gly Asp Thr ser Ala Thr 
180 185 190 

Glu ser Gly Asp Glu val Pro val Glu Leu Tyr Thr Ala Phe Gin His 
195 200 205 

Thr Pro Thr ser lie Thr Leu Thr Ala Ser Arg val ser Lys val Asn 

210 215 220 

Asp Lys Arg Arg Lys Lys Ser Gly Glu Lys Glu Gin His lie ser Lys 
225 230 235 240 
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Cys Lys Lys Ala Phe Arg Glu Gly Ser Arg Lys ser Ser Arg Val Lys 
245 250 255 

Gly ser Ala Pro Glu lie Asp Pro Ser ser Asp Gly ser Asn Phe Gly 

260 265 270 

Trp Glu Thr Lys lie Lys Ala Trp Met Asp Arg Tyr Glu Glu Ala Asn 
275 280 285 

Asn Asn Gin Tyr ser Glu Gly val Gin Arg Glu Ala Gin Arg lie Ala 
290 295 300 

Leu Arg Leu Gly Asn Gly Asn Asp Lys Lys Glu Met Asn Lys Ser Asp 
305 310 315 320 

Leu Asn Thr Asn Asn Leu Leu Phe Lys pro pro Val Glu ser His lie 
325 330 335 

Gin Lys Asn Lys Lys lie Leu Lys ser Ala Lys Asp Leu pro Pro Asp 
340 345 350 

Ala Leu lie lie Glu Tyr Arg Gly Lys Phe Met Leu Arg Glu Gin Phe 
355 360 365 

Glu Ala Asn Gly Tyr Phe Phe Lys Arg Pro Tyr Pro Phe Val Leu Phe 
370 375 380 

Tyr ser Lys Phe His Gly Leu Glu Met cys val Asp Ala Arg Thr Phe 
385 390 395 400 

Gly Asn Glu Ala Arg Phe lie Arg Arg Ser cys Thr Pro Asn Ala Glu 
405 410 415 

val Arg His Glu lie Gin Asp Gly Thr lie His Leu Tyr lie Tyr ser 
420 425 430 

lie His Ser He Pro Lys Gly Thr Glu lie Thr He Ala Phe Asp Phe 
435 440 445 

Asp Tyr Gly Asn Cys Lys Tyr Lys val Asp cys Ala cys Leu Lys Glu 
450 455 * 460 

Asn Pro Glu cys Pro val Leu Lys Arg Ser Ser Glu ser Met Glu Asn 
465 470 475 480 

lie Asn ser Gly Tyr Glu Thr Arg Arg Lys Lys Gly Lys Lys Asp Glu 
485 490 495 

Asp lie Ser Lys Glu Lys Asp Thr Gin Asn Gin Asn He Thr Leu Asp 
500 505 510 

cys Glu Gly Ala Thr Asn Lys Met Lys ser Pro Glu Thr Lys Gin Arg 
515 520 525 

Lys Leu Ser Pro Leu Arg Leu ser Val ser Asn Asn Gin Glu Pro Asp 
530 535 540 

Phe lie Asp Asp lie Glu Glu Lys Thr Pro lie Ser Asn Glu Val Glu 
545 550 555 560 

Met Glu ser Glu Glu Gin He Ala Glu Arg Lys Arg Lys Met Thr Arg 

565 570 ' 575 

Glu Glu Arg Lys Met Glu Ala He Leu Gin Ala Phe Ala Arg Leu Glu 
580 585 590 

Lys Arg Glu Lys Arg Arg Glu Gin Ala Leu Glu Arg He Ser Thr Ala 
595 600 605 
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Lys Thr Glu Val Lys Thr Glu cys Lys Asp Thr Gin lie Val Ser Asp 
610 615 620 

Ala Glu Val lie Gin Glu Gin Ala Lys Glu Glu Asn Ala Ser Lys Pro 
625 630 635 640 

Thr Pro Ala Lys val Asn Arg Thr Lys Gin Arg Lys Ser Phe ser Arg 
645 650 655 

Ser Arg Thr His lie Gly Gin Gin Arg Arg Arg His Arg Thr val Ser 
660 665 670 

Met cys Ser Asp lie Gin Pro ser ser Pro Asp lie Glu val Thr Ser 

675 680 685 

Gin Gin Asn Asp He Glu Asn Thr Val Leu Thr He Glu Pro Glu Thr 
690 695 700 

Glu Thr Ala Leu Ala Glu lie lie Thr Glu Thr Glu Val Pro Ala Leu 
705 710 715 720 

Asn Lys cys Pro Thr Lys Tyr Pro Lys Thr Lys Lys His Leu Val Asn 
725 730 735 

Glu Trp Leu ser Glu Lys Asn Glu Lys Thr Gly Lys Pro ser Asp Gly 
740 745 750 

Leu Ser Glu Arg Pro Leu Arg lie Thr Thr Asp Pro Glu Val Leu Ala 
755 760 765 

Thr Gin Leu Asn ser Leu Pro Gly Leu Thr Tyr Ser Pro His Val Tyr 

770 775 780 

ser Thr Pro Lys His Tyr lie Arg Phe Thr Ser Pro Phe Leu Ser Glu 
785 790 795 800 

Lys Arg Arg Arg Lys Glu Pro Thr Glu Asn He Ser Gly Ser Cys Lys 
805 810 815 

Lys Arg Trp Leu Lys Gin Ala Leu Glu Glu Glu Asn Ser Ala lie Leu 
820 825 830 

His Arg Phe Asn ser Pro cys Gin Glu Arg Ser Arg Ser Pro Ala Val 
835 840 845 

Asn Gly Glu Asn Lys Ser Pro Leu Leu Leu Asn Asp ser cys ser Leu 
850 855 860 

Pro Asp Leu Thr Thr Pro Leu Lys Lys Arg Arg Phe Tyr Gin Leu Leu 
865 870 875 880 

Asp ser Val Tyr ser Glu Thr Ser Thr Pro Thr Pro Ser Pro Tyr Ala 
885 890 895 

Thr Pro Thr His Thr Asp He Thr Pro Met Asp Pro ser Phe Ala Thr 
900 905 910 

pro Pro Arg lie Lys Ser Asp Asp Glu Thr Cys Arg Asn Gly Tyr Lys 
915 920 ^ ^ 925 

Pro He Tyr ser Pro Val Thr Pro Val Thr Pro Gly Thr Pro Gly Asn 
930 935 940 

Thr Met His Phe Glu Asn He Ser Ser Pro Glu ser Ser Pro Glu He 
945 950 955 960 

Lys Arg Arg Thr Tyr Ser Gin Glu Gly Tyr Asp Arg Ser Ser Thr Met 
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965 970 975 

Leu Thr Leu Gly Pro Phe Arg Asn Ser Asn Leu Thr Glu Leu Gly Leu 
980 985 990 

Gin Glu lie Lys Thr lie Gly Tyr Thr Ser Pro Arg Ser Arg Thr Glu 
995 1000 1005 

Val Asn Arg Gin Cys Pro Gly Glu Lys Glu Pro Val ser Asp Leu Gin 
1010 1015 1020 

Leu Gly Leu Asp Ala Val Glu Pro Thr Ala Leu His Lys Thr Leu Glu 
1025 1030 1035 1040 

Thr Pro Ala His Asp Arg Ala Glu Pro Asn Ser Gin Leu Asp Ser Thr 
1045 1050 1055 

His ser Gly Arg Gly Thr Met Tyr Ser ser Trp Val Lys Ser Pro Asp 
1060 1065 1070 

Arg Thr Gly Val Asn Phe Ser Val Asn Ser Asn Leu Arg Asp Leu Thr 
1075 1080 1085 

Pro ser His Gin Leu Glu Val Gly Gly Gly Phe Arg lie ser Glu Ser 

1090 1095 ^ 1100 

Lys Cys Leu Met Gin Asp Asp Thr Arg Gly Met Phe Met Glu Thr Thr 
1105 1110 1115 1120 

Val Phe Cys Thr Ser Glu Asp Gly Leu val Ser Gly Phe Gly Arg Thr 
1125 1130 1135 

val Asn Asp Asn Leu lie Asp Gly Asn cys Thr Pro Gin Asn Pro Pro 
1140 1145 1150 

Gin Lys Lys Lys ser Pro Val Gly Asn Phe Val Gly Ser Asn Val Val 
1155 1160 1165 



<210> 2 
<211> 262 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Ala Trp Arg Ala Leu His His Trp Leu Leu Leu Leu Leu Phe Pro 
15 10 15 

Gly ser Gin Ala Gin ser Lys Ala Gin val Leu Gin ser Val Ala Gly 
20 25 30 

Gin Thr Leu Thr val Arg Cys Gin Tyr Pro Pro Thr Gly ser Leu Tyr 
35 40 45 

Glu Lys Lys Gly Trp Cys Lys Glu Ala Ser Ala Leu val cys lie Arg 
50 55 60 

Leu Val Thr ser Ser Lys Pro Arg Thr Met Ala Trp Thr Ser Arg phe 
65 70 75 80 

Thr He Trp Asp Asp Pro Asp Ala Gly Phe Phe Thr val Thr Met Thr 
85 90 95 

Asp. Leu Arg Glu Glu Asp Ser Gly His Tyr Trp Cys Arg lie Tyr Arg 
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100 

Pro Ser Asp Asn Ser Val 
115 



li stage sequences P852PCT2 
105 110 

Ser Lys ser val Arg Phe Tyr Leu Val val 
120 125 



ser Pro Ala Ser Ala ser Thr Gin Thr Pro Trp Thr Pro Arg Asp Leu 
130 135 140 

Val Ser Ser Gin Thr Gin Thr Gin ser Cys val Pro Pro Thr Ala Glv 
145 150 155 160 

Ala Arg Gin Ala Pro Glu Ser Pro ser Thr lie Pro Val Pro Ser Gin 
165 170 175 

Pro Gin Asn Ser Thr Leu Arg Pro Gly Pro Ala Ala Pro He Ala Leu 
180 185 190 

val Pro Val Phe Cys Gly Leu Leu val Ala Lys ser Leu val Leu ser 
195 200 205 

Ala Leu Leu Val Trp Trp Gly Asp lie Trp Trp Lys Thr val Met Glu 
210 215 220 

Leu Arg Ser Leu Asp Thr Gin Lys Ala Thr cys His Leu Gin Gin val 
225 230 235 240 

Thr Asp Leu Pro Trp Thr ser Val ser ser Pro val Glu Arg Glu He 
245 250 255 

Leu Tyr His Thr Val Ala 
260 



<210> 3 
<211> 3507 
<212> DNA 

<213> Homo sapiens 

<400> 3 
atgagcatag 
gcaggttcag 
tatccccacc 
tatgcggacc 
tcagtcctta 
tccagtgcta 
tgcatatgtg 
tggcaacata 
gaacgttgtc 
agggaaaata 
gaattatata 
tccaaagtta 
tgtaaaaagg 
gagattgatc 
atggatcgat 
caaagaatag 
ttgaatacca 
aaaattctta 
aagtttatgc 
tttgtgttat 
gggaatgagg 
attcaagatg 
gaaattacta 
tgcctcaaag 
atcaatagtg 
gaaaaagata 
aagagcccag 
caggaaccag 



tgatcccatt 
aaccagaatc 
agttatatac 
ataattatgg 
ttagcaaaaa 
ctacaatcag 
gttttacaca 
ttgactgcat 
agcctaggaa 
tgtcagatgg 
ctgcatttca 
atgataaaag 
catttcgtga 
cttcatctga 
atgaagaagc 
ctctgagatt 
acaatttgct 
aatctgcaaa 
tgagagaaca 
tctactctaa 
ctcgattcat 
gaaccataca 
ttgcctttga 
aaaacccaga 
gttatgagac 
cacaaaatca 
aaactaaaca 
attttattga 



gggggttgat 

cgtagaagct 
cagcagctca 
tgctcgtcct 
tgaagtaggc 
cacatctgag 
tgatgatgga 
ggggattgat 
tttggataaa 
tgataccagt 
gcatactcca 
aaggaaaaaa 
aggatctagg 
tggttcaaat 
aaataacaac 
aggcaatgga 
cttcaaacct 
agatttgcct 
gtttgaagca 
atttcatggg 
caggcggtct 
tctttatatt 
ttttgactat 
gtgccctgtt 
cagacggaaa 
gaatattact 
aagaaagctt 
tgatatagaa 



acagcagaga 
agccctgtgg 
catcattcac 
cctccgacac 
atatttacca 
gatggaagtt 
tacatgatct 
aggcagcata 
gagagggcag 
gcaactgaga 
acatcaatta 
agcggggaga 
aagtcatcaa 
tttggatggg 
cagtatagtg 
aatgacaaaa 
cctgtagaga 
cctgatgcac 
aatgggtatt 
ctagaaatgt 
tgtacaccca 
tattctatac 
ggaaattgta 
ctaaaacgta 
aaaggaaaaa 
ttggattgtg 
tctccactga 
gaaaaaactc 



cgtcatactt 
tagttgagaa 
acagttacat 
ctccggcttc 
ctcctaattt 
atggtactga 
gttgtgacaa 
ttcctgatac 
tgctactaca 
gtggtgatga 
ctttaactgc 
aagaacaaca 
gagttaaggg 
agacaaagat 
agggtgttca 
aagagatgaa 
gccatataca 
ttatcattga 
tctttaaaag 
gtgttgatgc 
atgcagaggt 
acagtattcc 
agtacaaggt 
gttctgaatc 
aagacgaaga 
aaggagcgac 
gactatcagt 
ctattagtaa 



ggaaatggct 
atccaacagt 
tggtttgccc 
ccctcctcca 
tgatgaaact 
tgtaaccagg 
atgcagcgtt 
atatctatgt 
acgccggaaa 
ggttcctgtg 
ttcaagagtt 
catttcaaaa 
ttcagctcca 
caaagcatgg 
gagggaggca 
taaatccgat 
aaagaataag 
atacagaggg 
accataccct 
aaggactttt 
gaggcatgaa 
aaagggaact 
ggactgtgca 
catggaaaat 
tatttcaaaa 
caacaaaatg 
atcaaataat 
tgaagtagaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1250 

1320 

1380 

1440 

1500 

1560 

1620 

1680 
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atggaatcag aggagcagat tgcagaaagg aaaaggaaga tgacaagaga agaaagaaaa 1740 
atggaagcaa ttttgcaagc ttttgccaga cttgaaaaaa gagagaaaag aagagaacaa 1800 
gctttggaaa ggatcagcac agccaaaact gaagttaaaa ctgaatgtaa agatacacag 1860 
attgtcagtg atgctgaagt tattcaggaa caagcaaaag aagaaaatgc tagcaagcca 1920 
acccctgcca aagtaaatag aactaaacag agaaaaagtt tttctcggag taggactcac 1980 
attggacagc agcgtcggag acacagaact gtcagcatgt gttcagatat ccagccatct 2040 
tctcctgata tagaagttac ttcacaacaa aatgatattg aaaatactgt acttacaata 2100 
gaaccagaaa ctgaaactgc actagcagaa ataattactg aaactgaagt tccagcactt 2160 
aataaatgtc ctaccaagta ccccaaaaca aagaagcact tggttaatga atggttaagt 2220 
gagaagaatg agaagacagg aaaaccttca gatggccttt cagaaaggcc tctacgcata 2280 
actacagatc ctgaagtgtt agctacacaa ctcaattctt taccaggtct cacttacagc 2340 
ccccatgtat actccactcc taagcattat attagattta cttcaccatt cctttcagaa 2400 
aaaaggagaa gaaaagaacc tactgaaaac atttctggtt catgcaagaa gcgatggttg 2460 
aaacaagctc tggaagaaga aaattcagca attttacata gatttaattc accctgtcaa 2520 
gaaagatcca gaagtcctgc agtcaatggt gaaaataaaa gtccactact attaaatgac 2580 
agctgttccc ttccagattt aactacacca ctaaaaaaac gaagatttta tcagttgcta 2640 
gattcggttt actcagaaac ctccacacct actccttccc cgtatgctac accaactcac 2700 
accgatatta ctcctatgga cccatctttt gccacgcctc cacggataaa atcagatgat 2760 
gaaacttgta gaaatggtta taaacccata tattcaccag ttaccccagt aactcctggt 2820 
acaccaggaa ataccatgca ctttgagaat atttcttccc cagaaagttc tccagaaata 2880 
aagagacgca cttatagtca agagggatat gacagatctt caaccatgtt aacattgggg 2940 
ccttttagaa attctaattt aactgaactg ggtctgcaag aaataaagac tattggttat 3000 
acgagcccta ggagtaggac tgaagtcaac aggcagtgtc ctggagaaaa ggaacctgtg 3060 
tcagaccttc agctaggact cgatgcagtt gagccaactg ccctacataa aaccctggaa 3120 
acgcctgcac atgacagggc tgagcccaac agccaactgg actcgactca ctctggacgg 3180 
ggcacaatgt attcttcctg ggtaaagagc cctgacagaa caggagttaa cttctcagtg 3240 
aactccaact tgagggacct gacaccctcg catcagttgg aggttggagg aggcttccga 3300 
ataagtgagt caaagtgcct gatgcaggat gatactagag gcatgtttat ggaaacaact 3360 
gtgttttgta cttccgaaga tgggcttgta tctggtttcg gacggactgt taatgacaat 3420 
ttgatcgacg ggaattgcac accccagaat ccaccacaaa agaaaaagag tccagttggc 3480 
aactttgtgg gaagcaatgt agtatag , 3507 



<210> 4 
<211> 787 
<212> DNA 

<213> Homo sapiens 
<400> 4 

atggcctggc gagccctaca ccactggcta ctgctgctgc tgttcccagg 
caatccaagg ctcaggtact tcaaagtgtg gcagggcaga cgctaaccgt 
tacccgccca cgggcagtct ctacgagaag aaaggctggt gtaaggaggc 
gtgtgcatca ggttagtcac cagctccaag cccaggacga tggcttggac 
acaatctggg acgaccctga tgctggcttc ttcactgtca ceatgactga 
gaagactcag gacattactg gtgtagaatc taccgccctt ctgacaactc 
tccgtcagat tctatctggt ggtatctcca gcctctgcct ccacacagac 
ccccgcgacc tggtctcttc acagacccag acccagagct gtgtgcctcc 
gcGagacaag cccctgagtc tccatctacc atccctgtcc cttctcagcc 
acgctccgcc ctggccctgc agcccccatt gccctggtgc ctgtgttctg 
gtagccaaga gcctggtgct gtcagccctg ctcgtctggt ggggggacat 
accgtgatgg agctcaggag cctggatacc caaaaagcca cctgccacct 
acggaccttc cctggacctc agtttcctca cctgtagaga gagaaatatt 
gttgcaa 



ctctcaggca 60 
gagatgccag 120 
ttcagcactt 180 
ctctcgattc 240 
tctaagagag 300 
tgtctctaag 360 
cccctggact 420 
cactgcagga 480 
acagaactcc 540 
tggactcctc 600 
atggtggaaa 660 
tcaacaggtc 720 
atatcacact 780 
787 



